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METHOD OF MULTIPLE-ANTENNA WIRELESS COMMUNICATION 

USING SPACE-TIME CODES 

Field of the Invention 

This invention relates to wireless radiofrequency communication, and more 
specifically, to methods of signal modulation for communication using multiple- 
5 antenna arrays. 

Art Background 

In wireless communication, certain advantages are offered by the use of 

multiple antenna elements for transmission, whether with one or with more than one 
10 receiving antenna element. These advantages include the potential to mitigate fading 

effects, and the potential to increase data transmission rates in a propagation channel 

of given characteristics. 

A variety of schemes have been proposed for modulating data to be 

transmitted from a multiple-element array. In some of these schemes, referred to 
15 generally as space-time modulation, the data are transmitted in the form of codewords 

distributed in space — i.e., across the antenna array — and in time. Such a codeword 

comprises a plurality of complex-valued amplitudes modulated onto a carrier wave. 
Within a given time interval, referred to as a symbol interval, a complex 

amplitude (which might be zero) is transmitted from each element of the antenna 
20 array. Conversely, at each element of the array, a sequence of amplitudes is 

transmitted over a succession of symbol intervals. The concurrent transmission of 

amplitudes from the elements of the array during one symbol interval is referred to as 

a channel use. 

A codeword of the kind described above can be represented by a matrix. The 
25 respective entries of the matrix are proportional to the complex amplitudes to be 

transmitted. Each column of the matrix corresponds, e.g., to a respective transmitting 
antenna, and each row corresponds, e.g., to a respective symbol interval. 

A variety of schemes have also been proposed for recovering the transmitted 
data from signals received by a single receiving antenna or a multiple-element 
30 receiving antenna array. Mathematical models of the propagation channel between 
the transmitting and receiving antennas generally include a matrix of channel 
coefficients, each such coefficient relating the amplitude received at a given element 
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of the receiving array to the amplitude transmitted from a given element of the 
transmitting array. In some of the known reception schemes, the channel coefficients 
are assumed to be known, exemplarily from measurements made using pilot signals. 
When the channel coefficients are known, methods of signal recovery can be 
5 used that effectively invert the channel matrix. Both direct and indirect methods are 
known for effectively inverting the channel matrix. Among the indirect methods are 
Maximum Likelihood (ML) detectors. Given an estimate of the channel matrix and a 
received signal, an ML detector computes a likelihood score for each of a plurality of 
candidate codewords, and selects that candidate codeword that yields the highest 
10 score. Because of noise and uncertainties in the channel coefficients due to fading, 
received signals are generally corrupted to a greater or lesser extent. Thus, it is 
fn advantageous to use codewords for which the likelihood scores have high 

"O discriminating power, even in the presence of fading and noise. 

'■Sis? 

Q One known method of space time modulation is V-BLAST. In V-BLAST, an 

^ 15 initial stream of data is apportioned into separate sequences of amplitudes, each of 

JS which is independently transmitted from one of the transmitting antenna elements. In 

effect, the codeword can be represented by a row vector having M entries, where M is 
H the number of transmitting antennas. The single row represents a single symbol 

hk interval. Typically, a new codeword is transmitted in each symbol interval. The 

20 independent sequence of amplitudes transmitted by each antenna can be referred to as 
H ! a substream because it contains a respective subset of the data in the initial data 

stream. 

Several schemes have been described for recovering V-BLAST signals. Some 
such schemes use ML detectors. According to another such scheme, the entries of the 

25 transmitted vector are recovered one-by-one, with each successive recovery utilizing 
the results of the previous recoveries. One example of such a scheme is described in 
the co-pending U.S. patent application Serial No. 09/438,900, filed November 12, 
1 999 by B. Hassibi under the title "Method and Apparatus for Receiving Wireless 
Transmissions Using Multiple- Antenna Arrays," and commonly assigned herewith. 

30 V-BLAST is advantageous in that it can be used for communication at 

relatively high data rates without excessive computational complexity in the decoding 
of the received signals. However, the decoding schemes that offer the lowest 
complexity require that the number N of receiving antennas must equal or exceed the 
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number M of transmitting antennas. Such a requirement is disadvantageous when, for 
example, a large installation such as a base station is transmitting to a small 
installation such as a hand-held mobile wireless terminal. 

Another method of space time modulation is described in S.M. Alamouti, "A 
5 simple transmitter diversity scheme for wireless communications," IEEE J. SeL Area 
Comm. (October 1998) 1451-1458. In the Alamouti scheme, each codeword is 
distributed over two transmit antennas and two symbol intervals. Each codeword is 
determined by two distinct complex amplitudes, each belonging to a respective 
substream. In the first symbol interval, one of the amplitudes is transmitted from the 
10 first antenna, and the other amplitude is transmitted from the second antenna. In the 
second symbol interval, the complex amplitudes are interchanged between the two 
antennas, one of the complex amplitudes changes sign, and the complex conjugates of 
the resulting amplitudes are transmitted. Significantly, when a codeword of this kind 
□ is expressed in the form of a matrix, the matrix has orthogonal columns. 

l Z 15 One drawback of the Alamouti scheme is that it makes the most efficient use 

Si 

45 of the theoretical information capacity of the propagation channel only when there is a 

" Ss i 

single receiving antenna. The channel capacity is used less efficiently when further 

J J receiving antennas are added. Thus, gains that might otherwise be expected in data 

iMb rate and fading resistance from multiple-antenna receiving arrays are not fully 

7 i~JL 

Q 20 realized. 

Extensions of the Alamouti scheme to more than two transmitting antennas 
and more than two symbol intervals per codeword are also known. The Alamouti 
scheme and its extensions are referred to generally as orthogonal designs because the 
matrices that represent the codewords are required to be orthogonal; that is, each 
25 column of such a matrix is orthogonal to every other column of the matrix. A further 
requirement of orthogonal designs is that for a matrix to represent a codeword, all 
columns of the matrix must have the same energy. In this regard, the "energy" of a 
column is the scalar product of that column with its complex conjugate. 

Until now, there has been an unmet need for a space-time modulation scheme 
30 that can handle high data rates with relatively low decoding complexity and that uses 
the potentially available channel capacity with relatively high efficiency for any 
combination (M 9 N) of transmission and reception antennas. 
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Summary of the Invention 

We have invented such a scheme. Our scheme uses space-time matrices to 
spread the transmission of data over two or more transmit antennas and/or over two or 
more symbol intervals. ( A "matrix" in this regard may consist of a single column or 

5 a single row.) Initially, blocks of data are encoded as complex amplitudes selected 
from a finite set of such amplitudes. (Complex values include those that are pure real 
and pure imaginary.) We refer to each selected complex amplitude as a "symbol," 
and we refer to the finite set as a "constellation." If the constellation has r elements, 
then each symbol carries log 2 r bits of information. 

10 The constellation is predetermined, and also a fixed, finite set of space-time 

matrices is predetermined. We refer to the matrices in this set as "dispersion 
matrices." In the following discussion of an exemplary embodiment of the invention, 
we let Q represent the number of dispersion matrices in the fixed, finite set. 

In transmission, Q symbols are transmitted concurrently. Each of the Q 

15 symbols to be transmitted is multiplied by a respective dispersion matrix. A 

composite matrix, proportional to the sum of all Q dispersion matrices multiplied by 
their corresponding symbols, is transmitted according to the principles of space-time 
modulation described above. 

The elements of the dispersion matrices are advantageously selected according 

20 to a procedure that seeks to drive the rate at which data can be sent and received 
toward the information-theoretic channel capacity. 

We have found that orthogonal designs fall significantly short of achieving the 
information-theoretic capacity of the channel whenever there are more than two 
transmitting antennas or more than one receiving antenna. 

25 In contrast to transmission methods using orthogonal designs, our method 

does not constrain the columns of the transmitted, composite matrix to be orthogonal 
or to have equal energies. 

In reception, knowledge of the dispersion matrices is used to recover the Q 
symbols from the received signals corresponding to the composite matrix that was 

30 transmitted. 

In a broader aspect of the invention, the total number of dispersion matrices is 
2Q. Half of the dispersion matrices are used to spread the real parts of the Q symbols, 
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and the other half are used to spread the imaginary parts of the Q symbols. 
Alternatively, half of the dispersion matrices are used to spread the Q symbols, and 
the other half are used to spread the complex conjugates of the Q symbols. 

5 Brief Description of the Drawing 

FIG. 1 is a conceptual drawing of a multiple-antenna wireless communication 
system of the prior art. 

FIG. 2 is a conceptual diagram showing the operation of linear dispersion 
coding according to the invention in some embodiments. 
10 FIG. 3 is a flowchart illustrating the application of a criterion on the 

maximality of mutual information for designing a linear dispersion code according to 
the invention in some embodiments. 

FIG. 4 is a graph showing the theoretical performance of certain coding 
techniques in terms of bit-error rate (ber) or block-error rate (bier) as a function of 
15 signal-to-noise ratio (SNR). Compared in the figure are the results of theoretical 
modeling of a linear dispersion code (solid curves labeled "LD Code") and an 
orthogonal design (broken curves labeled "OD"). 

Detailed Description 

20 Certain general features of space-time modulation will now be described with 

reference to FIG. 1. Let there be M transmit antennas 10.1-10.M, and N receive 
antennasl5.1-15.M. Let the propagation channel be reasonably well modeled as a 
narrow-band, flat-fading channel that is effectively constant and known to the receiver 
for a duration whose length is at least T symbol intervals. The transmitted signal can 

25 then be written as a T x M matrix S that governs the transmission over the M antennas 
during the interval. 

Illustrated schematically in FIG. 1 is the transmission of the first row of the 
signal matrix S. During the first of T symbol intervals, the complex amplitude S u is 
modulated onto a radiofrequency carrier and transmitted from antenna 10.1, and each 

30 of the remaining complex amplitudes S n ,... 9 S lM is modulated onto the carrier and 
transmitted from a corresponding antenna 10.2, . . . , 10.M 
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At the receiving end, all of the transmitted amplitudes are intercepted by each 
of the receiving antennas 15.1-15.N, with varying attenuations and phase delays 
determined by the characteristics of the propagation channel, which is described by 
the matrix H of channel coefficients. Thus, after demodulation to baseband, the signal 
5 from each receiving antenna resulting from each channel use is a linear combination 
of the amplitudes S n , . . . , S lM , with complex weights determined by the propagation 
channel, plus additive noise. The outputs over T symbol intervals, corresponding to 
the response of the receiver to the transmission of matrix S, can be represented as a 
T x N matrix X+ F, where X contains the linear combinations described above, and V 
10 contains the additive noise. Illustrated schematically in FIG. 1 is the receipt of the first 
row of matrix X+ V. 

Certain features of the present invention will now be described with reference 
to FIG. 2. The steps shown in FIG. 2 are merely illustrative. Those skilled in the art 
will appreciate that numerous alternative procedures will bring about equivalent 
15 results, and thus fall within the scope and spirit of the present invention. 

A sequence 20 of data, exemplarily a binary sequence of O's and l's, is parsed 
into substreams. In the example shown, the number Q of substreams is 3, and each 
block of data in a substream carries three bits of information. A block 25 of data from 
each substream is mapped to a symbol 30 selected from constellation 35. The 
20 illustrative constellation shown in FIG. 2 is a set of eight uniformly spaced points on 
the unit circle in the complex plane. More typically, the constellation will be an r- 
PSK or r-QAM constellation. 

In the example shown, the image of each block of data is a respective one of 

the symbols s l9 s 29 s 3 . Each of these symbols directly multiplies a respective 
25 dispersion matrix A^A^A^. In process 40, the complex conjugate is taken of each 
symbol, thus generating a further symbol. Each of the resulting complex conjugates 
multiplies a respective dispersion matrix B l9 B 2 ,B 3 . In process 45, which is 
represented in the figure as a summation element, the signal matrix S is constructed 
by summing the six dispersion matrices, with each weighted by its corresponding 
30 symbol. 

More generally, Q symbols are selected from an appropriate 

constellation. The signal matrix S is constructed according to: 
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where 



s q =a q +j/3 q9 ? = U& (2) 
We refer to a code of this kind as a rate R=(Q/T)\og2T linear dispersion (LD) 

code. 

The code is completely specified by the fixed T x M complex matrices A h ...,A Q and 
B\ ,. . ., B Q , which we refer to as dispersion matrices. Each individual codeword is 
determined by the scalars {s\ 9 ... 9 s Q }. 

Alternatively, S is expressed by: 

Q 



where the C a and i>_ are the fixed TxM dispersion matrices. 

In specific implementations, one or more of the A q or B q , or one or more of 
the C or D q , matrices could be zero. In fact, it is essential only that there be at least 

Q non-zero dispersion matrices. 
15 As noted above, in a narrow-band, flat-fading, multi-antenna communication 

system with M transmit and AT receive antennas, the transmitted and received signals 
are related by a linear relationship. We here represent that relationship by: 

x =(^ Hs+ °> (4) 

20 where the complex JV-dimensional vector x denotes the vector of complex received 
signals during any given channel use, the complex M-dimensional vector s denotes 
the vector of complex transmitted signals, the complex NxM matrix H denotes the 
channel matrix, and the complex iV-dimensional vector v denotes additive noise 
which, for purposes of theoretical analysis, is assumed to be spatially and temporally 

25 white; i.e., to be CN (0,1) (zero-mean, unit-variance, complex-Gaussian) distributed. 
For analytical purposes, the channel matrix if and transmitted vector s are assumed to 
have unit variance entries, implying that E tr HH * = MN and Es*s = M , where E 
( . ) denotes the statistical expected value. 
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Assuming that the quantities H, s, and u are random and independent, the 

normalization J— in Eq. (4) will insure that p is the signal-to-noise ratio (SNR) at 
V M 

the receiver independently of M. For analytical purposes, it is also often (although not 
invariably) assumed that the channel matrix H also has CN (0,1) entries. 
5 The entries of the channel matrix are assumed to be known to the receiver but 

not to the transmitter. This assumption is reasonable if training or pilot signals are 
sent to learn the channel, which is then constant for some coherence interval. The 
coherence interval of the channel is preferably large compared to M. 

When the channel is effectively constant for at least T channel uses we may 
10 write for each symbol interval /, 

x *=\T7 Hs <+°t> '=i>».,r, (5) 

V M 

so that defining X T - [x Y x 2 ...x T ],S T =[s r s 2 ...s T ]andV T =[o 1 u 2 ...o T ], we obtain 

X T = &HS T +V T . (6) 
VM 

15 

It is generally more convenient to write this equation in its transposed form 

X=^-SH+V, (7) 

where we have omitted the transpose notation from H and simply redefined this 
20 matrix to have dimension MxN. The complex TxN matrix X is the received 
signal, the complex TxM matrix S is the transmitted signal, and the complex 
TxN matrix V is the additive CN (0,1) noise. InX, 5, and V 9 time runs vertically 
and space runs horizontally. 

We note that, in general, the number of T x M matrices S needed in a 
25 codebook can be quite large. If the rate in bits/channel use is denoted R, then the 
number of matrices is 2 RT . For example, with M- 4 transmit and N= 2 receive 
antennas the channel capacity at p = 20 dB (with CN (0,1) distributed H) is more than 
12 bits/channel use. Even with a relatively small block size of T = 4, we need 2 48 « 
10 14 matrices at rate R = 12. 
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LD codes can readily generate the very large constellations that are needed. 
Moreover, because of their structure, they also allow efficient real-time decoding. 
Decoding. An important property of the LD codes is their linearity in the 

variables \a q ,f3 q } , leading to efficient decoding schemes such as those used in 

5 connection with V-BLAST. To see this, it is useful to write the block equation 



X = ^SH + V = ^±{a q A q+ jJ1 q B q )H + V (8) 

in a more convenient form. We decompose the matrices in Eq. (8) into their real and 
10 imaginary parts to obtain 

Denoting the columns of X R , X h H R , H h V R , and V t by and v ln , 

1 5 where n = 1, . . . , N , we form the single real system of equations 



X R,l 










X I,1 


\M 




+ 


°h\ 


X R,N 




a Q 






_ X I,N _ 




A. 







(10) 



where the equivalent 2NT x 2Q real channel matrix is given by 
20 H = 




We now introduce the following definitions: 
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(12) 



We have a linear relation between the input and output vectors s and x , 
respectively: 

[ P 



x = , 



-Hs + v, 



(13) 



5 where the equivalent channel H is known to the receiver because the original channel 
H, and the dispersion matrices {A Qi B q } are all known to the receiver. (Those skilled 
in the art will appreciate that an equivalent treatment can be formulated in terms of 
the dispersion matrices {C q ,D q } in place of the matrices {A q , B q }. The matrices 

{C 9 D q } are defined by Eq. (3), above.) 
10 The receiver simply uses Eq. (1 1) to find the equivalent channel. The system 

of equations between transmitter and receiver is not undetermined as long as 

Q<NT. 

We may therefore use any decoding technique already known for use, e.g., 
with V-BLAST, such as successive nulling and cancellation, its efficient square-root 
15 implementation, or sphere decoding. The most efficient implementations of these 
schemes generally require 0((f) computations and have roughly constant complexity 
in the size of the signal constellation r. Sphere decoding, which is an efficient species 
of maximum-likelihood decoding, will in at least some cases be particularly 
advantageous. 

20 Design of the dispersion matrices* In a broad sense, the mutual information 

between the input vector s and the output vector x is a measure of channel capacity 
as constrained by our definition of the "equivalent channel," and contingent on the 
choice of dispersion matrices. When maximized, the mutual information expresses 
the maximum data rate achievable through the use of linear dispersion codes as 

25 described here, for given values of Q and T and for given numbers of transmit and 
receive antennas. 
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For purposes of the exemplary design method to be described below, we now 
define the mutual information between the input vector s and the output vector x as 

— E logdetfL^ +— HH T ) , where E ( . ) denotes the statistical expected value, 
IT M 

I 2NT is the identity matrix of dimension 2NT , and H T is the transpose of the matrix 
5 H. 

As a general practice, we find it useful to take Q = min(M,N)T since this tends 
to maximize the mutual information between s and x while still having some coding 
effects. 

We choose {A q ,B q } to maximize the mutual information between S and x . 
10 We formalize the design criterion as follows. 



1 . Choose Q <NT (typically, Q = min(MiV> T). 

2. Choose {Aq, B q } that solve the optimization problem 



15 



C LD (p,T,M,N)= max — £logdet 
subject to one of the following constraints 



( p ~ V 

hNT+ — HH 

M 



(14) 



(ii) tr A* q A q = tr B q B q = ^ 9 q - 1, ...0 

20 (iii) AA=%B,=£l M9 q = UjQ 

where //is given by Eq. (1 1) with the entries of h R ,„ and h ln having independent N(0, J A) 
entries. (In our theoretical studies, we have assumed that the channel matrix H has 
independent C/V(0,1) entries. However, our mutual information criterion is also 
readily applied for designing linear dispersion codes appropriate to channels described 
25 by other statistical distributions.) 

The problem expressed by Eq. (14) can be solved subject to any of the 
constraints (i)-(iii). Constraint (i) is simply the power constraint of Eq. (8) that 
ensures E tr SIS'* = TM. Constraint (ii) is more restrictive and ensures that each of the 

n 
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transmitted signals a and f} q are transmitted with the same overall power from the M 
antennas during the T channel uses. Finally, constraint (iii) is the most stringent, 
since it forces the signals a q and fi q to be dispersed with equal energy in all spatial 

and temporal directions. 
5 We have empirically found that of two codes with similar mutual 

informations, the one satisfying the more stringent constraint performs better. 
The constraints (i)-(iii) are convex in the dispersion matrices {A q ,B q }. 

However, the cost function — E log det [ I 2NT +—HH T I is neither concave nor 

2T \ M J 

convex in the variables {A q ,B q }. Therefore, it is possible that Eq. (14) has local 

10 maxima. Nevertheless, we have been able to solve Eq. (14) with relative ease using 

gradient-based methods and it does not appear that local minima pose a great 

problem. 

The block length T is essentially also a design variable. Although it must be 
chosen shorter than the coherence time of the channel, it can be varied to help the 
15 optimization of Eq. (14). We have found that choosing M<T <2M often yields good 
performance. 

It should be noted that any code designed for a given number of receive 
antennas is also readily used for a greater number of receive antennas. 

With reference to FIG. 3, there is input at block 50 of that figure a statistical 

20 description of the propagation channel H. At block 60, the block length T, the size M 
of the transmit array, and the size N of the receiving array are provided to the 
processor. At block 65, the number Q is specified. At block 70, the optimization 
problem is solved to determine the set of 2Q dispersion matrices that maximizes the 
mutual information. At block 75 the processor outputs the dispersion matrices and the 

25 equivalent channel matrix H . At block 80, a calculated value of the mutual 
information is output by the processor. 

Example 

We will present an orthogonal design of block length T= 4 for M = 3 transmit 
30 antennas, and will then compare the orthogonal design to a linear dispersion code for 
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M= 3 transmit antennas and N = 1 receive antennas . The orthogonal design is 
written in terms of \a q j and {p q j as 



OC 2 + jfil a 3+J& 

-a 2 + j/3 2 a x -jp x 0 
-a 3 + j/3 3 0 

0 ~cc 3 +j& a 2 -ip 2 



(15) 



It turns out that this orthogonal design is also an LD code because, as we have 
found, it is a solution to Eq. (14) for T= 4 and Q = 3. It achieves a mutual 
information of 5.13 bits/channel use at p = 20 dB, whereas the channel capacity is 
6.41 bits/channel use. 

To find a better LD code, we first observe that it is advantageous for Q to obey 
the constraint Q<NT, with N=l and T=4. Therefore Q < 4, and we choose Q = 4. 
After optimizing (14) using a gradient-based search, we find: 



ffj + + j 



-«2 + a 4 



IT 



-j 



Pi , Pi~h 

12 2 



«2 _a 4 . 



A I2-I3 

^72 2 



. P 2 + h 



a 2 +a 4 

IT 



I2 2 



a 2 +a 4 

~ L T ± *J 
V2 

a l~ a 3 + J 



a 2 -a 4 



IT 



Px^hrh. 



-(*3+jP4 



a 2 +a 4 



6 



+j 



Pi Pi -Pi 

11 2 

p\ P2-P3 

12 2 



(16) 

This code has a mutual information of 6.25 bits/channel use at p = 20 dB, 
which is most of the channel capacity. Figure 4 compares the performance of the 
orthogonal design of Eq. (15) with the LD code of Eq. (16) at rate R = 6. (The rate of 
either code is (Q/T) log2r; we achieve R = 6 by having the orthogonal design send 
256-QAM, and the LD code send 64-QAM.) The decoding in both cases is the 
efficient form of nulling/cancelling described in U.S. Patent Application Serial No. 
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09/438,900. We see from Figure 4 that the LD code performs uniformly better, 
worth noting that the matrix S in Eq. (16) has orthogonal columns. 



Mathematical Details 

Normalization of the dispersion matrices. For purposes of theoretical 
analysis, we have assumed that the transmit signal S is normalized such that E tr SS* 

= TM. This induces the following normalization on the matrices {A q , B q j : 

Z(trAlA g +trBlB g ) = 2TM. 

Mathematical formulas for use in solving the optimization problem. In 

this section, we compute the gradient of the cost function of Eq. (14). To help 
compute this gradient, we rewrite the cost function in Eq. (14) as 



— £log det 
2T 



f 










T 






p Q 


Aq : 


■• 0 " 


"V 


'V 




kq : 


•• 0 " 




0 • 


■• A * 




% 




0 • 





















(17) 



where for q = 1, Q and n = 1, N, we have defined 



^q 



-4* 



-B 



R,q 



B 



St. 



(18) 



The subscript "R" denotes "real part", and "F denotes "imaginary part". 
Define the matrix appearing in the log det(.) of Eq. (17) as Z. That is, 
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z = 



P Q 
l2NT + M^l 



A q ... 0 



H, 



H 



N 



H 



A q - 0 



Define: 
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(19) 
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R q =E 
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[HI -Hi] 
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0 • 


■• K 


) 



(20) 



The gradients of the cost function / = E • log det Z are given by: 




q,i+(2n-2)T ,j+(2n-2)M + ^,/+(2«-l)f J+(2«-l)M ) ( 2 * ) 



~ Z-f \ U s /+(2w-l)r j+(2/i-2)M fg,i +(2»-2)r J+(2i 



/+(2ii-1)W ) ( 22 ) 



R 



q,i+(2n-l)T,j+{2n-2)M iv qj+(2n-2)TJ+(2n-l)M 



Z-f^ iV ^/+(2n-2)7'J+(2«-2)M T i ^,/+(2n-l)r,7+(2n-l)A/ ^ 



) (23) 



(24) 
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